Maximum-throughput mapping of SDFGs on multi-core SoC platforms
نویسندگان
چکیده
Data-Flowmodels are attracting renewed attention because they lend themselves to efficientmapping on multi-core architectures. The key problemof finding amaximum-throughput allocation and scheduling of Synchronous Data-Flow graphs (SDFGs) onto amulti-core architecture is NP-hard and has been traditionally solved by means of heuristic (incomplete) algorithms with no guarantee of global optimality. In this paperwepropose an exact (complete) algorithm for the computation of amaximum-throughputmapping of applications specified as SDFG onto multi-core architectures. This is, to the best of our knowledge, the first complete algorithm for generic SDF graphs, including those with loops and a finite iteration bound. Our approach is based on Constraint Programming, it guarantees optimality and can handle realistic instances in terms of size and complexity. Extensive experiments on a large number of SDFGs demonstrate that our approach is effective and robust. © 2013 Elsevier Inc. All rights reserved.
منابع مشابه
Predictable mapping of streaming applications on multiprocessors
Predictable Mapping of Streaming Applications on Multiprocessors The design of new consumer electronics devices is getting more and more complex as more functionality is integrated into these devices. To manage the design complexity, a predictable design flow is needed. The result should be a system that guarantees that an application can perform its own tasks within strict timing deadlines, in...
متن کاملEfficient parallelization of the genetic algorithm solution of traveling salesman problem on multi-core and many-core systems
Efficient parallelization of genetic algorithms (GAs) on state-of-the-art multi-threading or many-threading platforms is a challenge due to the difficulty of schedulation of hardware resources regarding the concurrency of threads. In this paper, for resolving the problem, a novel method is proposed, which parallelizes the GA by designing three concurrent kernels, each of which running some depe...
متن کاملSynergy: A HW/SW Framework for High Throughput CNNs on Embedded Heterogeneous SoC
Convolutional Neural Networks (CNN) have been widely deployed in diverse application domains. There has been significant progress in accelerating both their training and inference using high-performance GPUs, FPGAs, and custom ASICs for datacenter-scale environments. The recent proliferation of mobile and IoT devices have necessitated real-time, energy-efficient deep neural network inference on...
متن کاملPredictable SoC architecture based on COTS multi-core
With the increasing complexity of real-time embedded applications and the availability of Commercial-Off-TheShelf (COTS) multi-cores, time-predictable execution on these platforms has become a necessity. However, there are several challenges to achieving this predictability, primarily arising due to hardware resources shared between the cores (memory controllers, caches and shared interconnect)...
متن کامل11.5 Memory Resource Allocation and Scheduling in MPSoC
With the availability of advanced MPSoC and emerging Dynamic RAM (DRAM) interface technologies, an optimal allocation of logical data buffers to physical memory cannot be handled manually anymore due to the huge design space. An allocation does not only need to decide between an onor off-chip memory, but also needs to take an increasing number of available memory channels, different bandwidth c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 73 شماره
صفحات -
تاریخ انتشار 2013